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Abstract 

^ Characterization of the rate-equivocation region of a general wiretap channel in- 

O . volves two auxiliary random variables: U, for rate splitting and V, for channel prefixing. 

Evaluation of regions involving auxiliary random variables is generally difficult. In this 
^ ■ paper, we explore specific classes of wiretap channels for which the expression and 

evaluation of the rate-equivocation region are simpler. In particular, we show that 

■ when the main channel is more capable than the eavesdropping channel, V = X is 

■ optimal and the boundary of the rate-equivocation region can be achieved by vary- 
O ' 

ing U alone. Conversely, we show under a mild condition that if the main receiver 
is not more capable, then V = X is strictly suboptimal. Next, we focus on the class 
of cyclic shift symmetric wiretap channels. We explicitly determine the optimal se- 
^ ■ lections of rate splitting U and channel prefixing V that achieve the boundary of the 

' rate-equivocation region. We show that optimal U and V are determined via cyclic 

shifts of the solution of an auxiliary optimization problem that involves only one aux- 
iliary random variable. In addition, we provide a sufficient condition for cyclic shift 
symmetric wiretap channels to have [/ = (/> as an optimal selection. Finally, we ap- 
ply our results to the binary-input cyclic shift symmetric wiretap channels. We solve 
the corresponding constrained optimization problem by inspecting each point of the 
I{X;Y) — I{X; Z) function. We thoroughly characterize the rate-equivocation regions 
of the BSC-BEC and BEC-BSC wiretap channels. In particular, we find that U = (j) 
is optimal and the boundary of the rate-equivocation region is achieved by varying V 
alone for the BSC-BEC wiretap channel. 



*This work was supported by NSF Grants CCF 07-29127, CNS 09-64632, CCF 09-64645, CCF 10-18185 
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1 Introduction 



We consider the discrete memoryless wiretap channel shown in Fig. [H The capacity region of 
this channel is characterized by the rate, i?, between the legitimate users Alice and Bob, and 
the equivocation, i?e, at the eavesdropper Eve. Wyner [1] characterized the rate-equivocation 
region when the received signal at Eve is a degraded version of the signal received at Bob. 
Csiszar and Korner [2] characterized the rate-equivocation region for general, not necessarily 
degraded, wiretap channels. 

Csiszar and Korner's characterization involves two auxiliary random variables: U, for rate 
splitting, and V, for channel prefixing. Evaluation of capacity regions involving auxiliary 
random variables is generally difficult, and it is desirable to determine cases where the 
auxiliary random variables are either not needed or their optimal selection is simplified. For 
the wiretap channel, under certain conditions, it is known that the use of one or both of these 
auxiliary random variables is unnecessary. For instance, if the wiretap channel is degraded, 
neither rate splitting nor channel prefixing is necessary, i.e., the selection [7 = and V = X 
is optimal, for the entire rate-equivocation region [1]. In fact, the same conclusion holds 
if the wiretap channel is less noisy [21 Theorem 3]. For general wiretap channels, for the 
purposes of characterizing the secrecy capacity, i.e., the largest equivocation, rate splitting is 
unnecessary, i.e., U = (p is optimal [2]; further, if the wiretap channel is more capable, then 
channel prefixing as well is unnecessary, i.e., U = (p and V = X are optimal [2]. 

In this paper, we explore specific classes of wiretap channels for which calculation of the 
optimal rate splitting and/or channel prefixing parameters is simpler. The inclusion relations 
among the classes of wiretap channels considered in this paper are shown in Fig. [2l First, 
we show that if the wiretap channel is more capable, then channel prefixing is unnecessary; 
that is, the rate-equivocation region can be characterized by rate splitting, i.e., ^ = X is 
optimal and the boundary of the rate-equivocation region can be traced with optimal {U, X) 
only. Conversely, we prove under a mild condition that, if the channel is not more capable, 
then channel prefixing is strictly necessary, i.e., V X is strictly needed. 

Next, we study the class of cyclic shift symmetric wiretap channels. We explicitly deter- 
mine the optimal selection of U and V that achieve the boundary of the rate-equivocation 
region. In particular, optimal U and V are expressed in terms of the cyclic shifts of the 
solution of an auxiliary optimization problem that involves only one auxiliary random vari- 
able. This is a considerable reduction in the computation requirement for the calculation 
of (the boundary of) the rate-equivocation region. We provide the cardinality bound on 
this single auxiliary random variable appearing in the optimization problem. Then, we 
formulate the problem as a constrained optimization problem. We provide a sufficient con- 
dition under which rate splitting is unnecessary, i.e., U = (p is optimal and the boundary of 
the rate-equivocation region is obtained by varying V alone. In particular, we show that if 
I{X; Y) — I{X; Z) is maximized at the uniform distribution, i.e., if the channel is dominantly 
cyclic shift symmetric, then this sufficient condition is satisfied and hence rate splitting is 
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Figure 1: The wiretap channel. 



unnecessary. Moreover, we show that if the main channel is more capable and both channels 
are cyclic shift symmetric, then {Cb.Cs) rate-equivocation pair is achievable. We also dis- 
cuss an extension of the notion of cyclic shift symmetry for continuous alphabets. Finally, 
we apply our results to the binary-input cyclic shift symmetric wiretap channels. We inves- 
tigate two examples that illustrate the considered cases: BSC-BEC and BEC-BSC wiretap 
channels. We provide full characterizations for the rate-equivocation regions of the BSC- 
BEC and BEC-BSC wiretap channels. We identify the conditions on the BSC cross-over 
probability and the BEC erasure probability that guarantee that channel prefixing or rate 
splitting or both are unnecessary. In particular, we find that rate splitting is never necessary 
for the BSC-BEC wiretap channel. 



2 Model and Background 

As in Fig.lll Alice communicates with Bob in the presence of an eavesdropper. Eve. The input 
and output alphabets, X , y and are finite. The main channel is characterized by p{y\x) 
and has capacity Cb = niaxp^ I{X] Y). Similarly the wiretapper channel is characterized by 
p{z\x) and has capacity Ce = niaxp^ I{,X; Z). W represents the message to be sent to Bob 
and kept secret from Eve with G W = {1, . . . , 2"^}. Alice uses an encoder y9 : W — )■ 
to map each message to a channel input of length n. Bob uses a decoder : — )■ W. The 
probability of error is: Pg = Pr 7^ W]. The rate R is achievable with equivocation 

i?e, if -Pe — ^ as 72 — )■ 00, and 

i?e = lim -H{W\Z'') (1) 

Perfect secrecjil] is achieved if ^I{W] Z^) — )■ and the secrecy capacity Cs is the highest 
achievable perfectly secure rate R. The maximum possible equivocation is also Cs- 

"'^We use the weak secrecy notion. However, for discrete wiretap channels weak and strong secrecy are 
equivalent [3l|4]. 
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Figure 2: Inclusion relations among the classes of wiretap channels. 



The input distribution belongs to the \X\ dimensional probability simplex denoted as 



\x\ 



^Pi = l, pi> 0, \fi 



(2) 



i=l 



Throughout the paper, /^(.) denotes the following function of the input distribution 



f,{P,) = {^i + l)I{X-Y)-I{X-Z) 



(3) 



where /x > is an arbitrary parameter. We denote /o(.) simply as /(.). Note that /^(.) is 
continuous and differentiable for all /i > 0. 

Csiszar and Korner [2] characterized the entire rate-equivocation region as stated in the 
following theorem. 

Theorem 1 (|[2l Corollary 2]) (i?, i?e) pair is in the rate- equivocation region if and only 
if there exist U^V^X^Y,Z such that I{U; Y) < I{U; Z), and 



0<Re< I{V] Y\U) - I{V; Z\U) 
Re<R< I{V;Y) 



(4) 
(5) 



Further, the secrecy capacity is 



Cs = max I(V; Y) - I(V; Z) 



(6) 
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Finally, the cardinality bounds on the alphabets of the auxiliary random variables are 



\U\<\X\+3 (7) 
\V\ <\X\'^ + A\X\ + 3 (8) 

The rate-equivocation region of a wiretap channel is a convex region. Therefore, the 
upper right boundary is traced by solving the following optimization problem for all /i > 
as in Fig. |3l 

max ^^I{V;Y) + I{V;Y\U)-I{V;Z\U) (9) 

Note that this optimization problem is computable due to the bounds on the sizes of U and 
in ([7]) and ([8]) in Theorem [1] In the sequel, we refer to the solution of the optimization 
problem in (jO]) as the optimal selections U*, V* and X*. These optimal selections depend 
implicitly on the value of /i. The optimal value of the objective function in (jO]) at /i = 
is the secrecy capacity Cg. In this case, U is unnecessary, and in fact, we get ([6]) |2]. Note 
that the bounds on the cardinalities of U and V in ©-dH]) in Theorem [1] are valid in general. 
However, the specific cardinality bound on V for the optimization problem in (jH]) when /x = 0, 
or equivalently the problem in (E]), is 

|V| < \X\ (10) 

In order to prove the cardinality bound in ( fTOj) . given V X Y, Z with PMFs p{v) and 
p{x\v), we fix the following \X\ continuous functions of p{x\v): 

( / I NX \ Px\v{i\v), j = -1 

[ /(X; Y\V = f ) - /(X; Z\V = v), ] = \X\ 

From Lemma 3 in [5] and the strengthened Caretheodory theorem of Fenchel-Eggleston 
in [6], we can find another random variable V' with cardinality at most \X\ such that 
V ^ X ^Y,Z and /(X; Y\V) - I{X; Z\V) = I{X; Y\V') - I{X; Z\V') as well as p{x) = 
J Px\vix\''j)dF{v) = X]t)' (^l'^')p('^') for X = 1, . . . , \X\ — 1; see also Appendix C in [7]. 
Since 

I{V- Y) - I{V- Z) = /(X; Y) - /(X; Z) - [/(X; Y\V) - /(X; Z\V)] (12) 

we conclude that liV; Y) - liV; Z) = I{V'; Y) - I{V'; Z). Therefore, |V| < \X\ cardinality 
is sufficient to solve the optimization problem in (|6]). 
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Figure 3: Characterization of the upper right boundary of the rate-equivocation region. 

3 More Capable Wiretap Channels 

More capable condition is a partial ordering for discrete memoryless channels as formally 
defined below. 

Definition 1 ([2j) p{y\x) is more capable than p{z\x) if f{Px) > for all G A. 

A wiretap channel is more capable if the main channel is more capable than the eavesdropping 
channel. 

In p] Theorem 3] , Csiszar and Korner observe that if the wiretap channel is more capable, 
then channel prefixing is unnecessary, i.e., = X is optimal, for achieving the secrecy 
capacity. We will strengthen this result. We will prove that if the wiretap channel is more 
capable, then channel prefixing is unnecessary for achieving the entire boundary of the rate- 
equivocation region. Conversely, we will prove, under a mild condition, that if the wiretap 
channel is not more capable, then V = X is strictly suboptimal, i.e., there exists V ^ X 
that improves the rate-equivocation region compared to V = X. 

Let ej denote the elementary PMF where all the mass is concentrated in the jth coor- 
dinate, i.e., its jth entry is 1 and all other entries are zero. Note that e^, j = 1, . . . , 
form the canonical basis for the \X\ dimensional Euclidean space, and in particular, A is the 
convex hull of e^-, j = 1, . . . , lA"!. An important topological property of A is stated in the 
next lemma, namely a point in the simplex A partitions the simplex into \X\ sub-simplexes 
in a specific way. 

Lemma 1 Let p and p' be two PMFs in A. There exists a PMF q and an index set 
J G {!,..., with \ J\ = \X\ — 1 such that 



where ji G J for i = 1, . . . , \X\ — 1. In particular, qi > if p' is an interior point of A. 




(13) 



i=l 
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Proof: The probability simplex A has comer points ej, j = 1, . . . Given p G A, we can 
find a triangulation [8] {'Di}\J^ of {ei, . . . , ei^^i, p} by combining \X\ — 1 of the corner points 
and p. Then, we get A = IJ!=i where Vi is the convex hull of [{ei, . . . , eix\} \ {^i}] U {p}, 
i = 1, . . . Ifp has a zero entry, then some Vi has smaller dimensionality, however this 
does not violate the generality. Hence, a given PMF p' resides inside one of V^. Moreover, 
if p' has all non-zero entries, then it is not in the convex hull of any proper subset of e^, 
j = 1, . . . , Hence, gi > in this case. ■ 

Lemma [1] says that a point in A partitions it into \X\ sub-simplexes which are convex 
hulls of I A"! — 1 of the vertices and the point itself. We illustrate this partitioning for 
lA"! = 3 in Fig. |H As a consequence, any PMF can be expressed as a convex combination 
of any other PMF and \X\ — 1 of the \X\ canonical PMFs e^. In fact, this partition and 
hence the representation in (JT3l) is unique. Only the existence of such a representation is 
sufficient for our arguments in this paper. In particular, we use this existence result to prove 
the main theorem of this section which is stated next. The proof of this theorem is provided 
in Appendix El 

Theorem 2 // the wiretap channel is more capable, V* = X is optimal for the entire bound- 
ary of the rate- equivocation region and the cardinality bound on U* is 

\U*\ < \X\ (14) 

Conversely, if the wiretap channel is not more capable, V = X is strictly suboptimal provided 
that f{Px) is maximized at an interior point of A. 

As a result, if the wiretap channel is more capable, channel prefixing is not necessary and 
hence the computation of the rate-equivocation region is considerably simplified. Moreover, 
the bound on the necessary rate splitting reduces by 3 compared to Csiszar and Korner's 
bound (from | ^¥1+3 to Another remark is that the direct part in Theorem[2]immediately 
extends for continuous alphabet wiretap channels. However, the converse part does not 
immediately extend as the proof is not valid for infinite dimensional spaces of probability 
density functions. 

We next review less noisy channels for future reference. Less noisy condition is a stronger 
partial ordering than more capable condition. 

Definition 2 ([2]) piy\x) is less noisy than p{z\x) if I{U;Y) > I{U;Z) for all U — )■ X — )■ 
Y,Z. 

A wiretap channel is less noisy if the main channel is less noisy than the eavesdropping 
channel. If a wiretap channel is less noisy (regions Q and (2) in Fig. [2]), neither rate splitting 
nor channel prefixing is necessary for the entire rate-equivocation region pi Theorem 3]. 



7 




Figure 4: Partitioning the probability simplex A. 

4 Cyclic Shift Symmetric Wiretap Channels 

In this section, we focus on cyclic shift symmetric channels. 

Definition 3 ([9]) p{y\x) is cyclic shift symmetric if I{X; Y) is invariant under cyclic shifts 
of the input distribution. 

Cyclic shift symmetric channels are an important class that includes binary symmetric, 
binary erasure, and type-writer channels. A key property of cyclic shift symmetric channels is 
that the input distribution that maximizes the mutual information is the uniform distribution 
P, Theorem 2]. A wiretap channel is cyclic shift symmetric if both the main channel and 
the eavesdropping channel are cyclic shift symmetric. 

In the following theorem, we determine the structure of the optimal auxiliary random 
variables U* and V* as well as the channel input X* for cyclic shift symmetric wiretap 
channels. Remarkably, the optimizing rate splitting U* and channel prefixing V* parameters 
can be determined by solving an auxiliary optimization problem over only one auxiliary 
random variable. In addition, the cardinality bounds on U* and V* are reduced to lA:*! and 
respectively, compared to the general case in ([7]) and (|8]). We provide the proof of this 
theorem in Appendix |Bl 

Theorem 3 In a cyclic shift symmetric wiretap channel, the optimal selection of the auxil- 
iary random variables U* and V* in ^ have the cardinalities \U*\ < \X\ and \V*\ < 
respectively, with the following structure: 

p{U*=u) = j^^, ue{l,...,\X\} (15) 

p{V* = {u-l)\X\+v\U* =u) =p{V = v), u,v e {1,...,\X\} (16) 

p{V* =v\U* =u) = 0, ue {l,...,\X\},v ^ {{u-l)\X\ + l,...,u\X\} 

(17) 

p{x\V* = {u-l)\X\+v) =p{x\V = v){u-l), u,v,x e {1,...,\X\} (18) 
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where p{X = x\V = v){u — 1) denotes the u — 1st cyclic shift of the distribution p{x\V = v). 
Moreover, the distributions p{V = v) and p{X = x\V = v) with \V\ < \X\ are the optimizers 
of the following auxiliary optimization problem: 

max (l{X;Y)-I{X;Z)-\{fi + l)I{X;Y\V)-IiX;Z\V)\] (19) 

V^X^Y,Z \ L -I / 

where (x)^ = max{0,x}. 

We illustrate the specific structure of the optimal auxiliary random variables and the 
channel input in Fig. [5l In particular, each element of U* generates the optimizing PMF p{V) 
over I A" I elements of V*. The first \X\ elements of V* generate the optimizing conditional 
PMF p{X\V = v) over X. The remaining elements of V* generate cyclic shifts of p{X\V = v) 
over X. An equivalent representation for the optimal selections can be obtained by letting 
V* = {V*,V*) with |Vi*| = IV2I = 

p(f/*=n) = ^, ue{l,...,\X\} (20) 

p{V* = {vi,V2)\U* = u)=p{V = vi)5iv2-u), u,vuV2e{l,...,\X\} (21) 
pix\V* = (i;i, V2)) = pix\V = vi)iv2 - 1), vi, V2e{l,..., (22) 

Note that U* is a deterministic function of V* as stated in [21 Theorem 1]. This is verified 
easily from the equivalent representation in (120|) - (!22|) . Given V* = {¥{ = fi,^* = V2), 
U* = V2 with probability 1. However, V* is a stochastic function of U*. These can also be 
verified from Fig. |5l 

The optimization problem in ( IT9l) is a constrained optimization problem over lAfp — 1 vari- 
ables: 1^*1 probability distributions on X, p{X = x\V = Vi). Each probability distribution 
accounts for lA:*! — 1 variables for i = 1, . . . ,\X\. In addition, the distribution for V accounts 
for 1^*1 — 1 variables. Let us define Aj = p(y = Vi) and Pi^P2^ ■ ■ = p{X = x\V = Vi) . 

We have Aj > 0, p^*'' > and Aj = 1, Xlj^i'^ — ^- The following is a restatement of the 
constrained optimization problem in (fT9l) : 



s.t. E^^ = i'Epf = i 



Xi > 0, > (23) 

Note that the cyclic shift symmetry assumption on Bob's and Eve's channels yields a 
significant reduction in the cardinalities of the auxiliary random variables. In particular, the 
bound on the rate splitting variable reduces from lA"! + 3 to 1^*1 and the bound on the channel 



9 



V* 



Figure 5: The structure of the optimal U* ^ V* ^ X for cychc shift symmetric wire- 
tap channels. piV) and p{X\V = v), v E {I, . . . , are the solutions of the auxiliary 
optimization problem in ( !T9|) . 

prefixing variable reduces from |A:'p + 4|A'|+3to l^l"^. In fact, the problem in f fT9|) for /i = 
is equivalent to finding the secrecy capacity Cg- Thus, in cyclic shift symmetric wiretap 
channels, solving a problem of the same number of variables as finding the secrecy capacity 
is sufficient to characterize the optimal selections of U and V for any point on the boundary of 
the rate-equivocation region. Another remark is that the constrained optimization problem 
in ( 123|1 for yU = is equivalent to finding the secrecy capacity for general wiretap channels 
not necessarily cyclic shift symmetric. 

The structure of the optimal auxiliary selections U* and V* for cyclic shift symmetric 
wiretap channels in Theorem |3] indicates a sufficient condition for f/ = to be an optimal 
selection: If the optimizing p{V = Vk) and p{X = x\V = Vk), k = 1, . . . ,\X\ in f lT9|) are such 
that 

p{V = Vk) = T^,, and p{X = x\V = Vk)=p{X = x\V = v^){k-l), Wk (24) 
I I 

then rate splitting is not necessary. In this V has cardinality each element of 

V generates a cyclic shift of p(X|V^ = vi), and a uniform distribution is generated over X. 
Therefore, the uniform V together with the cyclic prefix channel V ^ X in ([21]) maximize 
I{X; Y) and hence the objective function in (Q (c.f. the proof of Theorem [3] in Appendix [B|) . 
In other words, if is satisfied, then U* and V* as selected in (fT5|) - (fT8|) yield a uniform 
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PMF for p{X\U* = u) ioT all u e {1, . . . i.e., U* is independent of X. Therefore, if 

(!24|) is satisfied, U = (j) can be selected without losing optimahty, i.e., U is not necessary. 

Next, we consider a sub-class of cyclic shift symmetric channels, namely dominantly 
cyclic shift symmetric channels (c.f. \TU\ Definition 5]): 

Definition 4 A cyclic shift symmetric wiretap channel is dominantly cyclic shift symmetric 
z//(u) > f{Px), VP^ G A, where u is the \X\ dimensional uniform distribution. 

Note that from [TTl Theorem 3] and the fact that the uniform distribution is capacity achiev- 
ing for cyclic shift symmetric channels, a less noisy cyclic shift symmetric wiretap channel 
is also dominantly cyclic shift symmetric (see also |12])- We denote dominantly cyclic shift 
symmetric channels by the shaded region in Fig. [2l Note that all cyclic shift symmetric 
channels in regions Q and (2) are shaded. In the following lemma, we prove the sufficiency 
of dominant cyclic shift symmetry for having the solution of ( fT9i) satisfy the property in ( 12^ . 
We provide the proof of this lemma in Appendix O 

Lemma 2 For dominantly cyclic shift symmetric wiretap channels, a solution of the problem 
in / fTPj) is a distribution piV) and a prefix channel V X that satisfy the condition in [24\ )- 

A corollary of Lemma [2] and the structure in Theorem [3] is that rate splitting is unneces- 
sary for dominantly cyclic shift symmetric channels and the entire rate-equivocation region 
can be attained by channel prefixing alone. 

Corollary 1 In a dominantly cyclic shift symmetric wiretap channel, rate splitting does 
not improve the rate- equivocation region and optimal channel prefixing has the cardinality 
I V* I < In particular, 

Cs = max /(P,) - min /(P,) (25) 

Px 

We remark here that if the wiretap channel is dominantly cyclic symmetric, then known 
inner and outer bounds on the corresponding broadcast channel capacity region are shown 
to coincide in [10]. Therefore, the broadcast channel capacity region, which is in general an 
open problem, can be fully characterized for dominantly cyclic shift symmetric channels. We 
observe here that dominant cyclic symmetry yields a similar simplification for the wiretap 
channel, rendering rate splitting variable U unnecessary. However, note that the class of 
cyclic shift symmetric wiretap channels for which rate splitting is unnecessary is strictly 
larger than the class of dominantly cyclic shift symmetric channels. In fact, for all cyclic 
shift symmetric channels which satisfy (|2^ . f/ = is optimal and dominant cyclic shift 
symmetry is just a sufficient but not necessary condition for the property fl24l) . In Section [5l 
we provide examples for binary-input cyclic shift symmetric wiretap channels that are not 
dominantly cyclic shift symmetric but for which rate splitting is still unnecessary. 
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Note that the secrecy capacity expression in (125|) is the solution of the problem in (fT9l) 
for /i = 0. We also remark that ( 125|) is generally an upper bound for a wiretap channel: 



Cs = max I{V; Y) - I{V; Z) 

= max [J(X; Y) - /(X; Z)] - [/(X; F |\/) - /(X; 
< max /(P^) - min /(P^.) 



(26) 
(27) 
(28) 



but it is attained for dominantly cyclic shift symmetric wiretap channels by Corollary [H 

Next, we consider more capable cyclic shift symmetric channels. V* = X is optimal 
in this case due to Theorem El hence, the structure in Theorem |3] for general cyclic shift 
symmetric channels is further reduced under the more capable condition. In addition, the 
most componentwise dominant rate-equivocation pair {Cb,Cs) is achieved in more capable 
cyclic shift symmetric channels as we show in the following corollary. Further, if the channel 
is more capable and dominantly cyclic shift symmetric, {Cb, Cg) pair is achieved hj V* = X 
and U* = (f). We provide the proof of this corollary in Appendix [Dl 

Corollary 2 In a more capable cyclic shift symmetric wiretap channel, V* = X and the rate- 
equivocation pair {Cb, Cg) is achievable. In a more capable dominantly cyclic shift symmetric 
wiretap channel, {Cb,Cs) pair is achieved by V* = X and U* = 0. 

We remark here that Theorem [3] applies to more capable cyclic shift symmetric wiretap 
channels as well. Even though V^* = X is stated as optimal in Corollary [21 the general 
structure of U* and V* in Theorem [3] does not reduce to V* = X when more capable 
condition is imposed. Note that the cardinality of V* in Theorem [3] is lAfp while V* = X 
requires only a cardinality of \X\ for V*. However, {Cb,Cs) pair is achieved with both 
auxiliary selections. Therefore, U* and V* in Theorem [3] are not necessarily unique optimal 
auxiliary selections. 

More capable cyclic shift symmetric wiretap channels have already been covered in ^1] 
in the following example: 



p{y\x) 



2 



1 



/ 1 — p p 1 — q q \ 

p 1 — p q i — q 

1 — q q 1 ~ p p 

\ q ^ — q p 1 ~ p / 



and 



/ 1 — r 1 — r 



r 



r \ 



p{z\x) 



1 



1 — r 1 — r 



r 



r 



2 



r 



r 



1 — r 1 — r 



\ r 



r 1 — r 1 — r / 
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In [TT], it is shown that, for r close enough to 1/2 (depending on the values of p and g), the 
wiretap channel is more capable. However, in the same reference, the channel is shown to 
be not less noisy for any r, p and q. We now observe that p{y\x) and p{z\x) are cyclic shift 
symmetric channels. Therefore, by Corollary El {Cb,Cs) pair is achievable by a non-trivial 
U* with uniform distribution and V* = X when r, p and q are such that the wiretap channel 
is more capable. 

We note that the results obtained for discrete alphabet cyclic shift symmetric channels 
naturally extend if the alphabets are bounded continuous intervals. In particular, the def- 
inition of cyclic shift symmetry extends naturally for X = [0,6): If I{X;Y) is invariant 
under any modular shift in the input PDF, the channel is cyclic shift symmetric. Typical 
examples of continuous alphabet cyclic shift symmetric channels are modulo additive noise 
channels [I3]. If cyclic shift symmetry holds, the channel capacity is achieved at uniform 
distribution over X. Hence, if both the main and eavesdropping channels are cyclic shift 
symmetric, then the optimal selections U* and V* have the same structure as in Theorem 
131 The definition of dominant cyclic shift symmetry also extends similarly for continuous 
alphabets and rate splitting is not necessary for continuous alphabet dominantly cyclic shift 
symmetric wiretap channels. 

The result does not directly extend for unbounded input alphabets, i.e., for b = oo, 
with an average power constraint. Even if the cyclic shift symmetry holds, it may not be 
possible to generate Bob's capacity achieving input PDF by shifting the solution of the 
auxiliary optimization problem and therefore the proof method in Theorem |3] is not directly 
applicable. 

5 Binary-Input Cyclic Shift Symmetric Wiretap Chan- 
nels 

In this section, we consider cyclic shift symmetric wiretap channels with binary input: |Af | = 
2. Note that the cardinality requirement on V to solve the problem in (|T9l) is |V| = 2 for 
binary input wiretap channels. Let p{vi) = A, p{x\vi) = [pi, 1— Pi] and p{x\v2) = [p2, 1— ^2]- 
Let the resulting input distribution be Px = [px, ^ — Px]- The optimization problem in f|T9|) 
and ( l23l) for the binary-input case reduces to: 

max /(Api + (1 - \)p2) - XfM " (1 " >^)fM 
s.t. < \,pi,P2 < 1 (29) 

A geometrical visualization for the problem in ( l29l) is provided in Fig. El Two points are 
picked from the x-axis and their image on are combined to form a line. A determines the 
point of operation and the value of the objective function is the difference between / and 
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Figure 6: Visualization of the optimization problem in terms of pi, p2 and A. 

the formed line segment at that particular point. 

The necessary optimality conditions for the problem in fl29|) are found by taking the 
derivative of the objective function with respect to pi,p2 and A, respectively. If pl,P2, A* are 
strictly interior to the [0, 1] interval, i.e., not equal to or 1, then 



A* {f'ix*pi + (1 - y)p;) 
(1 - A*) {f'ix*pi + (1 - y)p*2) 
{pl-p;)f{ypl + ii-\np;)-ifM) 



- f'M)) = 

- f'M)) = 

- fM)) = 



(30) 
(31) 
(32) 



Note that A* 7^ 0, 1 as the objective function in f l29|) takes the value zero for A = 0, 1. Hence, 
the optimality condition in fl32|) always holds and we get: 



f'ixYi + (1 - x*)p;) 



fM) - fM 

p\-p*2 



(33) 



If, in addition, p^,p2 7^ 0, 1, 



nx*Pi + {I - x*)Pi) = fM) = f'M) 



(34) 



If = 0, 1 and p*2 ^ 0, 1, then 



f'{\*Pl + {i-\*)pl) = f'M) 



(35) 
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and similarly if = 0, 1 and j9* 7^ 0, 1, then 



f'ix*pl + (1 - \*)p;) = fiipi) 



(36) 



The conditions in (!33!) - (!36l) have the following geometric interpretation: In Fig. |6l the line 
drawn from (p*, /^(p*)) and (pg, ffiiP^)) must be tangent to the curve at both points. If pi 
or P2 are or 1, then this tangency does not have to hold at that point. We illustrate these 
conditions in Fig. [71 We observe that for the selections of pi and p2 in the configurations 
(a) and @, the line is tangent to f^{px) at only one point and the other point is either or 
1. However, (b) and (c) do not satisfy the optimality condition as the pi and p2 points lie 
interior to [0, 1] but the line is not tangent to In fact, we observe by inspection that (a) 
and (d) are the only possible configurations in the particular values chosen in Fig. [7] that 
satisfy the optimality conditions in ( I33|) -(l36l). 

Note that the geometric interpretation of the optimality conditions provide a simple 
check if a point p G (0, 1) is one of the optimal selections pl,P2'- Draw the tangent line for 
ffj, at p. If this tangent line does not intersect other than p or if it intersects at a point 
p' G (0, 1) but it is not tangent at p', then p cannot be an optimal selection. Also note 
that optimality conditions do not rule out the trivial selection pi = and p2 = 1. Hence, 
this selection is always a candidate to be an optimal selection. This selection is indeed 
optimal if f{px) > for all px € [0, 1], i.e., when the wiretap channel is more capable by 
Theorem O Geometrically, when f{px) > the points {pi, ffi{pi)) have nonnegative y-axis 
as f^ipx) > fiPx) > 0. The level of the line segment is the smallest when pi = 0, p2 = 1- 
Moreover, the points \pi + (1 — X)p2 span the space of two-dimensional PMFs when = 
and P2 = 1- Hence, the difference of the function and the line segment has the highest value 
when pi = and p2 = 1, that is, the optimal selection is pi = and p2 = 1- 



Let the main channel be BSC(e) and the eavesdropper's channel be BEC(a;). Note that both 
BSC and BEC are cychc shift symmetric. X = y = {0,1} and Z = {0, e, 1}. For < e < 0.5 
and the input distribution Px = [pxA ~ Px], we have 



where h{.) is the binary entropy function. We first investigate some geometric properties of 
the function f{px) in flSTjl . It can be shown [lOJ that when p{y\x) is BSC(e) and p{z\x) is 



5.1 The BSC-BEC Wiretap Channel 



f{px) = h{{2e - \)px + 1 - e) - h[e) - (1 - a)h[px) 



(37) 



BEC(a): 



1. If a < 4e(l — e), then Eve is less noisy than Bob. 



2. If 4e(l — e) < a < hie), Eve is more capable but not less noisy than Bob. 
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Figure 7: Optimality conditions for pi, p2 and A. @ and (d) satisfy the optimality conditions 
while (§) and (c) do not satisfy the optimahty condition. 

3. If h{e) < a, the wiretap channel is dominantly cyclic shift symmetric. 

An illustration of f{px) function for the BSC-BEC channel is provided in Fig.|8]for e = 0.1 
and various a. Note that for any e and a, f{px) < for some px', thus, the channel is not 
more capable and is always in region @ in Fig. |2i We observe that for a > h{e), f{px) 
is maximized at px = 0.5, i.e., the channel satisfies dominant cyclic shift symmetry. From 
Corollary [H rate splitting is not necessary for a > h{t) and moreover the required channel 
prefixing has |V*| = 2 with p{vi) = p{v2) = 1/2 and p{x\vi) = [a,l — a], p{x\v2) = [1 — a, a] 
where [a, 1 — a] is an input distribution that maximizes [(/i + 1)1 {X; Y) — I{X; Z)]. The 
secrecy capacity is 

Cs = max f{px) - min/(p^) (38) 

Px Px 

We also note that for a < 4e(l — e). Eve is less noisy than Bob, and the secrecy capacity is 
Cs = P]- We investigate the remaining case, which is 4e(l — e) < a < h{e), in the next 
subsection. 

5.1.1 The Case of 4e(l - e) < a < /i(e) 

When 4e(l — e) < « < h{e) in the BSC-BEC channel, neither Eve is less noisy nor the 
dominant cyclic shift symmetry holds. Secrecy capacity is still non-zero in this case and the 
rate-equivocation region has a non-empty interior. One can verify easily by tracing all points 
in [0, 1] that there are only 5 configurations that satisfy the necessary optimality conditions 
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Figure 8: f{px) as a function of px for the BSC-BEC channel. 

as well as the trivial selection V = X, i.e., pi = and p2 = 1. However, the trivial selection 
is immediately eliminated as f]i{px) < for some Px in this case, and hence = X is strictly 
sub optimal by Theorem O 

The other 5 configurations are shown in Fig. [9j In configurations @, ©, © and (d), 
either pi or p2 is on the boundary and in configuration (e), both pi and p2 are in the interior 
with the property pi G argminp^g[o,i] /^(px) and p2 = I — Pi- By comparing these three 
configurations, we observe that the optimum selection is always configuration (e). In other 
words, we have for > and for all < X,Pi,P2 < 1, 

/(0.5) - min Upx) > f{\pi + (1 - \)p2) - XfM) " (1 " >^)UP2) (39) 

Px 

Note that (e) has the desirable property that P2 = 1 — Pi and A* = i. This property 
is equivalent to the one in in the binary-input case. Therefore, f/ = is optimal, 
and the upper right boundary of the rate-equivocation region can be traced by V only. 
However, unlike the case of h{e) < a, if 4e(l — e) < a < h{e), there exists n > such that 
/(0.5) < miup^ ft,{Px)- We define /i* as 

/i* = min{/i : /(0.5) < min/^(p^)} (40) 

Px 

For fi > fi*, V defined as above cannot improve the objective function. Thus, trivial V is 
the optimal selection for /i > yU*. However, the highest achievable equivocation with a trivial 
V selection is zero as Eve's channel is more capable with respect to Bob's channel in this 
case. Hence, for /i > fi*, the only possible achievable point is ((7^,0). The general form of 
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Figure 9: Five configurations that satisfy optimality conditions, (e) is always tlie optimal. 

the rate-equivocation region is given in Fig. [TOl The upper right boundary includes the line 
segment that combines the point for which the supporting hne slope is /x* and the (Cs,0) 
point. This line segment has the slope /x*. 

In conclusion, rate splitting U is not necessary for determining the rate-equivocation 
region of the BSC-BEC wiretap channel and in particular the secrecy capacity is 

C, = /(0.5)-min/(p,.) (41) 

Px 

Note that fHT]) is in agreement with ( 138|) . as in that case maxp^ f{Px) is achieved aX Px = 0.5. 
5.2 The BEC-BSC Wiretap Channel 

Now, let the main channel be BEC(a) and the eavesdropper's channel be BSC(e). X = Z = 
{0, 1} and y = {0, e, 1}. We have the following facts ^U\: 

1. If a < 4e(l — e), then Bob is less noisy than Eve. 

2. If 4e(l — e) < « < h{e), then Bob is more capable but not less noisy than Eve. 

Hence, if 4e(l — e) < « < h{e), the wiretap channel is in region @ in Fig. [2l By [2j, 
Cs = maxp^ f{Px), and from Corollary [2], {Cb,Cs) is achievable. If a < 4e(l — e), as both 
channels are cyclic shift symmetric, by [HI Theorem 3], Cg = Cb — Ce and (Cb^Cs) is 
achievable. We investigate the remaining case, which is a > h{e), in the next subsection. 
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Figure 10: The general form of the rate-equivocation region of the BSC-BEC wiretap channel 
for 4e(l - e) < a < h{e). 

5.2.1 The Case of a > /i(e) 

In the BEC-BSC wiretap channel, if a > h{e), neither less noisy nor more capable condition 
holds. We first solve the optimization problem in (!29l) by inspecting the tangent lines drawn 
at interior points p G (0, 1). One can easily verify that, as in the BSC-BEC channel, there are 
only 5 possible configurations that satisfy the necessary optimality conditions in addition 
to the trivial selection pi = and p2 = 1- As Bob's channel is not more capable with 
respect to Eve in this case and f{px) is maximized at an interior point, by Theorem |2l trivial 
selection is strictly suboptimal in this case. In particular, the trivial selection is not optimal 
for all fi such that f^{pi) < for some pi G [0, 1]. We show the other 5 configurations in 
Fig. HH In configuration ©, pi,p2 e (0,1) with /^(pi) = /^(p2) and f'^{pi) = f'^{p2) = 0. 
Hence, it satisfies the optimality condition for A = 0.5. However, the objective function 
/(0.5) — 0.5 (/^(pi) + ff_i{p2)) < 0; therefore, this configuration cannot be optimal. The 
other configurations (a), (B), (c) and @ have pi or p2 on the boundary of [0, 1] interval as 
shown in Fig. [HI We observe that (a) and (B) achieve the same value of the objective 
function and it is always higher compared to that achieved by (c) and @. Therefore, (a) 
and (B) are optimal selections for the problem in fl29|) . Note that @ is obtained by cyclic 
shifts of (b). The configuration in @ is also represented as p{X = 0\V = Vi) = and 
p{X = 0\V = V2) = Pi where the line segment that combines (0,0) and (pi,/^(pi)) is 
tangent to the curve {px, ff^iPx))- Similarly, (b) is equivalent to p{X = 0\V = fi) = 1 and 
p{X = 0\V = V2) = 1 — pi. The rate equivocation region is traced by varying and finding 
Pi that satisfies the tangency and A* that yields the optimal value of the objective function 
given pi. In particular, we define /x* as 



H* = min{/x > 0| min/^(pa,) > 0} 



(42) 



19 




Px 

Figure 11: Five configurations that satisfy tlie necessary optimality conditions for tlie BEC- 
BSC wiretap cfiannel. (a) and (B) are botli optimal. 

For /i < /i*, we use the following U and V: 



p{U = ui) =p{U = U2) =0.5 (43) 

p{V = vi\U = ui) = X\ p{V = V2\U = ui) = l-X*, (44) 

p{V = V3\U = U2) = \*, p{V = V4\U = U2) = l-\*, (45) 

p(X = 0\V = vi) = 0, p{X = 0\V = V2) = Pi, (46) 

p(^X = 0\V = V3) = 1, p{X = \V = v^) = l-p, (47) 

For /i > fi*, V is not necessary as f^{px) > in this case. We obtain a case similar to the 
more capable condition and one can easily show that a non-trivial V does not improve the 
objective function. As V is not used for fi > /i*, the achieved rate liV; Y) = /(X; Y) and 



optimal selection of U as in Theorem [3] generates uniform distribution on the channel input 
X, which is capacity achieving for Bob's channel. Hence, for fi > fi*, Cb is achieved. The 
general form of the rate-equivocation region is depicted in Fig. [121 Note that the supporting 
line with slope /i* is on the boundary of the rate-equivocation region. 

5.3 The Existence of More Capable but Not Less Noisy Domi- 
nantly Cyclic Shift Symmetric Wiretap Channels 

For the BSC-BEC and BEC-BSC wiretap channels, we observe that if the channel is more 
capable but not less noisy, dominant cyclic symmetry does not hold. Conversely, if dominant 
cyclic symmetry holds the channel is not more capable. We question whether this property 
extends for general cyclic shift symmetric wiretap channels, i.e., we ask whether there exist 
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Figure 12: General form of the rate-equivocation region of the BEC-BSC wiretap channel 
when a > h{e). 



more capable not less noisy cyclic shift symmetric wiretap channels that satisfy dominant 
cyclic shift symmetry. In this subsection, we answer this question in the affirmative direction. 
We consider the following class of binary-input cyclic shift symmetric wiretap channels: 



p{y\x) 




and 



p{z\x) 



p ~ q q p 
q 1 — p — q p 



Bob's channel p{y\x) is BSC(e). Eve's channel p{z\x) is parameterized by p and q and 
if p = 0, it reduces to BSC(g). Using the same notation = [px,l — Px] and f{px) = 
I{X] Y) — I{X] Z), we have the first and second derivatives of f{px) as 



dpx 



2e)log 



;i -p-2q) log 



(2p^-l)e + l-p, 
Px + {1- 2px)e 

Px{p + 2q - 1) + (1 -j9 - g) 



Px{l -p-2q) + q 



(4J 



(26 - 1) 



{{2px - l)e + 1 - Px) {Px + (1 - 2px)e) 

{l-p-2qf 



+ 



{px{p + 2q -1) + {1 - p - q)) {px{l -p-2q) + q) 



(49) 



0.5. 



Since both p{y\x) and p{z\x) are cyclic shift symmetric, we have -^fipx) = at 
In addition, as the second derivative in fH9|) is quadratic in p^., it can vanish at at most 
two points in the [0, 1] interval. In fact, due to the cyclic symmetry, the second derivative 
vanishes at p* and 1 — p* or it does not vanish. Hence, f{px) can have at most one infiection 
point in the (0, 0.5) interval. 

If f{px) has strictly positive first and second derivatives at px = 0, then it has one 
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inflection point in the (0,0.5) interval. This is due to the fact that -^fiPx) > at = 
while -^fiPx) = at px = 0.5. As -^f{Px) is increasing at px = 0, -^f{px) should finish its 
ascent and start its descent in the interval (0,0.5). As the second derivative is a continuous 
function of px and due to the mean value theorem, it vanishes at a point in the (0, 0.5) 
interval. As the second derivative may vanish at at most one point in the (0, 0.5) interval, 
'^fiPx) > for Px G (0,0.5) and f{px) is maximized at px = 0.5. Hence, dominant cyclic 
shift symmetry holds if f{px) has strictly positive first and second derivatives at Px = 0. We 
next show that there exist parameters e, p and q such that the first and second derivatives 
of f{px) at = are strictly positive. The derivatives at p^ = are expressed as 

^/(PJ|,...0 = (1 - 2f) log (i^) - (1 - p - 2,) log {^—Y^) (50) 

,5,) 

dPx ^(1 ~ ^) [^—P^QJQ 

Note that -^f{Px)\p _q is strictly convex monotone decreasing and ^/(Pa-)|p _o strictly 
concave monotone increasing with e G (0, 0.5) for given p, g G (0, 1) with p+q < 1. Moreover, 
we have 

< (1 - p - 2,) log (LlLZlj (52) 

< (l^iLlM (53) 

< (^-P-^^ni-P) (54) 

{l-p-q)q 

where (l52i) is due to the fact that a;log(l + a;)>Oforx>— 1, (l53l) is due to the inequality 
log(l + x) < a; for a; > and fl5^ is due to > 1- Let c{p, q) = ^^~^i^plg)q~^^ ■ Given p 

and q, the second derivative evaluated at p^, = in flSTj) vanishes at e* G (0, 0.5) 

When e is selected as e*{p,q), the first derivative ^/(Px-)|p =o 




bip, g) = (1 - 2e*(p, q)) log ( ^-P^] - (1 - p - 2g) log ^ ^ ^ ) (56) 



e*{p,q) J \ q 

One can show that b{p, q) is strictly convex individually in p and q and its first derivatives 
with respect to p and q vanish at p + 2g = 1. As h{p^ Q') = when p + 2q = 1 and 6(p, g) ^ oo 
for both p + q = 1 and p + g = 0, we have 6(p, g) > for p + 2q ^ 1. Therefore, for given p 
and g with p + 2g 7^ 1, if e is chosen as e*(p, g), f{px) has positive first derivative at = 
and second derivative at p^, = as zero. Since the second derivative is continuous monotone 
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increasing and the first derivative is continuous monotone decreasing in e given p and g, there 
exists 5 > such that if e is chosen as e*(p, q) + 5, both derivatives are positive. Therefore, for 
given p and q with p+2q ^ 1, there exists an interval 5* = (e*(p, g), e*(p, q) + 5) C [0, 0.5] such 
that if e G 5, the resulting wiretap channel is more capable, not less noisy and dominant 
cyclic shift symmetric. We illustrate f{px) for such a wiretap channel in Fig. [131 The 
parameters are e = 0.4202, p = 0.6 and q = 0.25 for this figure. For p = 0.6 and q = 0.25, 
e*{p, q) = 0.4194; hence 6 > 0.0008 in this case. Note from Fig. [13] that the first and second 
derivatives of f{px) are positive at px = as it is monotone increasing and convex in a 
neighborhood of px = 0. Then, the inflection point is observed and the function becomes 
concave driving the first derivative to zero at Px = 0.5. 

In conclusion, we show that the intersection of the shaded region and region (3) in Fig. [2] 
is non-empty and for these channels neither rate splitting nor channel prefixing is necessarjQ. 



6 Conclusions 

In this paper, we provided new results on the computation of optimal rate splitting and chan- 
nel prefixing auxiliary random variables in evaluating the boundary of the rate-equivocation 
region of the discrete memoryless wiretap channel. We proved that if the wiretap channel 
is more capable, then channel prefixing is unnecessary and the entire boundary of the rate- 
equivocation region is traced by rate splitting alone. Conversely, if the channel is not more 
capable, we proved under a mild condition that a non-trivial channel prefixing is strictly 
necessary. Next, we focused on cyclic shift symmetric wiretap channels and explicitly deter- 
mined the optimal rate splitting and channel prefixing variables. We showed that in cyclic 
shift symmetric wiretap channels, it suffices to solve an optimization problem only over one 
auxiliary random variable. We explicitly characterized the optimal rate splitting and chan- 
nel prefixing random variables in terms of the solution of this single variable optimization 
problem. Next, we provided a sufficient condition for rate splitting to be unnecessary for 
achieving the boundary of the rate-equivocation region in cyclic shift symmetric wiretap 
channels; we showed that when /(X; Y) — I{X; Z) is maximized at the uniform input dis- 
tribution, i.e., when the wiretap channel is dominantly cyclic shift symmetric, then rate 
splitting is unnecessary. We also showed that when the wiretap channel is more capable and 
cyclic shift symmetric, {Cb,Cs) rate-equivocation pair is achieveable. Finally, we applied 
our results to binary-input cyclic shift symmetric wiretap channels and characterized the 
boundaries of the rate-equivocation regions of the BSC-BEC and BEC-BSC wiretap chan- 
nels. We provided full characterizations of the boundaries of the rate-equivocation regions 
for these wiretap channels by using a geometric framework. We found that rate-splitting 
is not necessary for the BSC-BEC wiretap channel. We also showed the existence of more 

^In a conference version of this work [14], we mistakenly claimed that rate splitting is strictly necessary 
for more capable but not less noisy channels. This also disproves [l4l Corollary 1]. 
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Figure 13: f{px) for a more capable but not less noisy dominantly symmetric wiretap channel, 
capable, not less noisy, dominantly cyclic shift symmetric wiretap channels. 



Appendices 



A Proof of Theorem 2 



Assume that p{ii\x) is more capable than p{z\x). For any f/— t-K— t-X— t-F, Z and /i > 0, 
we have 

^^I{y■Y) + i{y-Y\u) - i{y-z\u) 

= fi[I{X; Y) - /(X; Y\V)] + /(X; Y\U) - J(X; Z\U) - [J(X; Y\V, U) - J(X; Z\V, U)] 

(57) 

= /i/(X; Y) + /(X; Y\U) - /(X; Z\U) - [(^ + 1)/(X; Y\V) - /(X; (58) 
<^iI{X-Y) + I{X-Y\U)-I{X-Z\U) (59) 

where ( 1571) and ( l58l) follow from the Markov chain f/ — )■ — )■ X — t- y, Z, and ( l59il follows 
from the more capable condition. Therefore, using a non-trivial channel prefixing yields a 
loss in the objective function fiI{V; Y) + liV; Y\U) — I{V; Z\U) and = X is the optimal 
selection. In other words, in order to characterize the entire rate-equivocation region it 
suffices to solve the following optimization problem: 

max ^iI{X-Y) + I{X-Y\U)-I{X-Z\U) (60) 
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We claim that \IA\ < \X\ is sufficient for the solution of ( 160|) . Given f/ — t- X — y, Z, we fix 
the following \X\ continuous functions of Px\u{x\u)- \X\ — 1 components of Px\u{x\u), x = 
1, . . . , \X\ — 1 and I{X] Y\U = u) — I{X] Z\U = u). By Lemma 3 in [5] and the strengthened 
Caretheodory theorem of Fenchel-Eggleston in [6j, there exists a random variable f/' — ?■ X — ?■ 
F, Z such that 

Y) + /(X; Y\U) - /(X; Z\U) = fiI{X; Y) + /(X; Y\U') - /(X; Z\U') (61) 

Therefore, the optimization problem in fl60|) can be solved with the cardinality bound \U\ < 
Note the equivalence of the operations performed for proving the cardinality bounds in 
(EO]) and in (E]). 

To prove the converse statement, assume that p{y\x) is not more capable than p{z\x) 
and f{Px) is maximized at an interior point of A. That is, f{Px) > f{Px), VP^. G A, for 
some P* that has all non-zero entries. Moreover, as the more capable condition does not 
hold, f{Px) < for some input distribution P^. We use and P* to construct a non-trivial 
channel prefixing V such that V ^ X ^ Y, Z and 

fiP:)<IiV;Y)-IiV;Z) (62) 

and hence show the existence of a non-trivial channel prefixing that provides a higher se- 
crecy capacity and therefore a larger rate-equivocation region. Applying Lemma [1] to the 
distributions P^ and P*, there exists a PMF q E A, with gi > such that 

= qiPx + ^^^+1^^- (63) 

k=l 

for some index set J C {!,..., \X\} with \ J\ = \X\ — 1, and jk G J. We construct V with 
|V| = \X\ in the following manner: 

Pv{vk) = qk, k = l,...,\X\ (64) 

In addition, we select px\v{x\vi) = Px, Px\v{x\v2) = e^^, px\v{x\v\x\) = Eval- 
uating Px = '^kJ:iPv{vk)px\v{x\vk), we observe that, by ( l63ll . the constructed Px and the 
maximizer P* are the same. However, /(X; Y\V) — /(X; Z\V) < because given V = Vi, 

J(X; Y\V = V,) - /(X; Z\V = v,) = f{Px) < (65) 

while given V = for k ^ 1, 

/(X; Y\V = Vk) - /(X; Z\V = v^) = (66) 
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As gi > 0, we have 



I{X;Y\V) - I{X;Z\V) <0 (67) 

We take /i = in (158|) and the problem reduces to the calculation of the secrecy capacity. 
Therefore, U = (p is optimal in this case due to [2j. Then, using (167|) in (158|) for /i = gives 

J(X; F) - /(X; Z) < /(V; F) - /(\/; Z) (68) 

which yields (162|) . the desired result, since is P* and the left hand side of (168|) is f{P*)- 



B Proof of Theorem [3 



For given /i > 0, the optimal selections U* and are the solutions of the following opti- 
mization problem: 

max ijI(V;Y) + I(V;Y\U) - I(V;Z\U) (69) 

By using the steps in (j57l)-(j58l). we obtain an equivalent statement for ( l69l) as: 

max ^ M/(X; F) + /(X; F|t/) - /(X; Z\U) - [(/i + 1)/(X; Y\V) - /(X; Z|y)] (70) 

We have the following bound for the objective function in (170|) : 

^J(X; F) + J(X; F |f/) - J(X; Z|f/) - [(/x + 1)J(X; F|\/) - J(X; Z|r)] 
< max/x/(X; F) + max /(X; F|t/) - /(X; Z\U) - [(/x + 1)/(X; Y\V) - /(X; Z\V)] 

(71) 

</i/u(X;F)+ max /(X; F) - /(X; Z) - [(/i + 1)/(X; F|f ) - /(X; Z|y)] (72) 

where u denotes the \X\ dimensional discrete uniform random variable, and /u(X; F) denotes 
the mutual information obtained by choosing the PMF of X as u. In (172|) . we used the fact 
that maxp^ F) = /u(X; F) as Bob's channel is cyclic shift symmetric. Moreover, we 
used the fact that U is not needed, i.e., U = (p, for maximizing /(X; Y\U) — /(X; Z\U) — 
[(/i + 1)/(X; Y\V) - /(X; Z\V)]. Because, for given U ^ V ^ X ^ Y, Z , we can always 
pick Ui eU that maximizes 

max /(X; Y\U = Ui) - /(X; Z\U = Ui) 

- + = v)- i{X] z\v = vMv = v\u = m) (73) 
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and therefore, choose a deterministic U with U = u*, where u* is the argument of the 
maximization in (173|) . Consequently, we have 



= max I{X;Y)-I{X;Z)-[{fi + l)I{X-Y\V)-I{X;Z\V)] (74) 

Note that the right hand side of f l7^ is the claimed auxiliary optimization problem in the 
statement of the theorem. We use V notation to emphasize that the auxiliary random 
variables on the right and left hand sides of (fMl) are different. 

Next, we will show that the bound in fl72|) is satisfied with equality for any cyclic shift 
symmetric wiretap channel. Let V with p{V = v) and p{X\V = v), v E V, he the solution 
of the auxiliary problem in ( FMl) . First, we note that it suffices to consider V such that 
|V| < This follows by the arguments we have used in the previous cardinality bound 
proofs. In particular, given V ^ X ^ Y, Z , we fix \X\ — 1 components of Pj^^y{x\v), 
j = 1,..., I A'l-l, together with + 1)I{X;Y\V = v) - I{X; Z\V = v). By Lemma 3 in [5] 
and the strengthened Caretheodory theorem of Fenchel-Eggleston in 16], the problem in (|74l) 
can be solved with the cardinality bound |V| < \X\. Note the equivalence of the operations 
performed for proving the bounds in this problem and those in (l60l) and ([6]). 

Now, we construct the optimal U*, V* by using the optimal V for the auxiliary problem 
in (17^ as in the statement of the theorem. In particular, we select the cardinalities as 
\U*\ = \X\ and |V*| = with the distributions p{U* = m) = 4^ for u G W* and 



p{V* =v\U* = 1) =p{V = v), V e {!,..., (75) 
p(V*=v\U* = l) = 0, {!,..., (76) 

p(x\V* = v) =p{x\V = v), V e (77) 
p{V* = \X\+v\U* = 2) =p{V = v), V e {!,..., (78) 

p^V* = v\U* = 2) = 0, V ^{\X\ + 1,...,2\X\} (79) 
p{x\V* =\X\+v)=p{x\V = v){l), V e {!,..., (80) 

p{V* = {\X\-l)\X\+v\U* = \X\-1) =p{V = v), V e {!,... (81) 
p{V* = v\U* = = 0, V ^ {{\X\-1)\X\ + 1,...,\X\^} (82) 

p{x\V* = - +v)= p{x\V = v){\X\ - 1) (83) 



The structure of the construction in (!75l) - (l83l) is an expression of the U* ^ V* ^ X* in Fig. 
O Each element of U* generates the optimizing selection piV) for the problem in ([71]) over 
disjoint \X\ elements of V*. Each disjoint \X\ element of V* generates cyclic shifts of the 
optimizing selection p{X\V) for the input X. In fl75|) - fl83|) . we denote the kth cyclic shift 
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of the conditional PMF for the channel input X, p{x\V = v), as p{x\V = v){k). Note that 
the cardinality of is \X\ while that of the optimum V* is [A'p and [A'p conditional input 
PMFs, p{x\V* = v), are obtained by cyclic shifts of [Afl conditional input PMFs, p{x\V = v). 

We first observe that p{x\U* = i) are cyclic shifts of a fixed PMF over X for different 
i. In particular, in the construction in fl75|) - fl83|) . we selected p{x\V* = v) as cychc shifts of 
p{x\V = v) while we kept p{V* = v) the same as p{V = v). Hence, we have 

p{x\U* = i) =pf{x){i-l) (84) 



where Pf{x) = Xlt^'iPl^l^ ~ v)p{V = v). Note that Pf{x) is the maximizing input PMF 



for the auxiliary problem. Therefore, U* and V* generate a uniform PMF for X: 



p{x) = J2piu* = Mx\u* = i) = J2 r^P/(^)(^ = ^^^^ 

i=l i=l ' ' II 

Moreover, by construction of U* and V* and the cyclic shift symmetry of the channels, 
we observe that, for any given i, 

\x\ 

+ 1)J(X; Y\V* = v + iz- 1)\X\)-I{X- Z\V* = v + {i - l)\X\)] 

v=l 

p{y* = {i-l)\X\+v\U = i) 
= + 1)/(X; Y\V = v)- I{X- Z\V = vMV = v) (86) 

v=l 

= {^^+l)I{X■Y\V)-I{X■Z\V) (87) 
Therefore, we have 
(/i + 1)/(X; Y\V*) - I{X; Z\V*) 



+ Y\V* =v)- I{X; Z\V* = v)y{V* = v) (88) 

v=l 

\X\ lA-p 

^ ^ (^(/i + 1)J(X; Y\V* =v)- /(X; Z\V* = v)^p{v\U* = i)p{U* = i) (89) 



1 = 1 v = l 

\X\ \X 



E E ((/^ + Y\V* = v + {i- - /(X; Z\V* = v + {i- l)\X\))p{V 



i=l v=l 



(90) 

= {li + l)I{X-Y\V)-I{X-Z\V) (91) 
where flHT]) is obtained by using f l57|) and the fact that p{v\U* = i) is non-zero only for 
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{i - + l<v< Note that 

/(X; Y\U* = - /(X; Z\U* = = f{pf{x){z - 1)) (92) 

= fiPfix)), (93) 

Hence, given U* = i, we have 

/(X; Y\U* = z) - /(X; Z\U* = z) - [{fi + 1)/(X; Y\V*) - /(X; Z\V*)] 

= f{pf{x)) - [ifx + 1)/(X; Y\V) - /(X; Z\V)] (94) 

As Pf{x) is the maximizing input PMF for the auxihary problem, we have 

/(X; Y\U*) - J(X; Z\U*) - [(^ + 1)J(X; Y\V*) - /(X; Z\V*)] 

= max I{X-Y)-I{X-Z)-[{fi + l)I{X-Y\V)-I{X-Z\V)] (95) 

Since U* and generate a uniform PMF for X by ( !85|) . /(X; y) achieves its maximum, 
as well. Combining this with ( l95l) . we conclude that the constructed ?7* and V^* achieve the 
upper bound in (172|) and hence are optimal. 



C Proof of Lemma [2 

Let the optimal selection of the auxiliary V in the following problem be V* : 

max I{X;Y)-I{X;Z)-[{fi + l)I{X;Y\V)-I{X;Z\V)] (96) 

We will prove that at least one such V* satisfies the following property: 

p{V* = Vk) = T^, and p{X = x\V* = Vk)=piX = x\V* = Vi){k-l), VA; (97) 
I I 

Due to Theorem [31 we already know that the cardinality of V is bounded by 

Let i?* = maxp^ /(Pa;.). First, we obtain an upper bound for the objective function in 
dSSD: 

/(X; Y) - J(X; Z) - [(^ + 1)/(X; Y\V) - J(X; Z\V)] 

<R:- [(/i + 1)/(X; Y\V) - /(X; Z\V)] (98) 
< - min[(/i + 1)/(X; Y) - /(X; Z)] (99) 

where (jSH]) follows from J(X; y) - J(X; Z) < _R* and (jSHD is obtained by replacing [(/i + 
1)/(X; y|V^) — /(X; Z\V)] with its minimum possible value. 
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Now, we will show that the upper bound in (p9|) is achieved by an auxiliary V* of cardinal- 
ity |V| < I'^l with the desired property in (197|) . By the hypothesis, u = argmaxp^ -^(-^; Y) = 
argmaxp^ f{P^). Moreover, let P* = argminpj(/i + Y) - I{X; Z)]. Note that P* 

is different from the uniform distribution. By cyclic shift symmetry, there exist 1^*1 — 1 
other input distributions that minimize (/x + Y) — I{X; Z), which are cyclic shifts of 

P*, denoted by P*{i) for i = 1, . . . — 1. Therefore, we define the channel prefixing V* 
with |V*| = \X\ as piV* = Vi) = with transition probabilities p{x\V* = vi) = P*, 

p{x\V* = V2) = -P^(l), • • • , and p{x\V* = v\x\) = -P^d'^l - !)• Then, the input distribution 
is Px = Yl[=iPiV* = = ^- For this selection of V*, we have 

/(X; Y) - I{X; Z) - [(^ + 1)/(X; Y\V*) - /(X; Z\V*)] 

= f{Px = u) - min[(^ + 1)J(X; Y) - /(X; Z)] (100) 

= Rl - min[(/i + 1)/(X; Y) - /(X; Z)] (101) 

Note that (110 II) is equivalent to the upper bound in fl99l) . Moreover, the specified channel 
prefixing V* satisfies the desired property in ( 1971) by construction. 

D Proof of Corollary [2] 

First, we select V* = X due to Theorem El Next, we note that there exists at least one 
input distribution, denoted by P*, that maximizes f{Px), since it is a bounded continuous 
functional of P^ and the probability simplex A is compact. 

There exist lAfj — 1 other input distributions (cyclic shifts of P*) that achieve the max- 
imum f{Px)- Let us define the auxiliary U, with U = {ui, . . . ,u\x\}, with marginal distri- 
bution pu{ui) = and transition probabilities px\u{x\ui) = P*, px\u{x\u2) = -P^(l), • • • , 
and Px\u{x\u\x\) = -P^d'^l ~ 1)' where -P^(^) denotes the zth cyclic shift of P*. 

Evaluating ([5]) with the specified choice of U* and with V* = X, we have I{V*; Y) = Cb, 
since Px = J2u£U Pui'>^)Px\u{x\u) = J2i=i Pxi^) = where u is the uniform distribution, 
and since Bob's channel is cyclic shift symmetric. On the other hand, evaluating (jll) for 
this specific choice, we get /(X; Y\U) — /(X; Z\U) = Cs, since for any u & U, /(X; Y\U = 
u) — /(X; Z\U = u) = maxp^ f{Px) = Cg. This proves that {Cb, Cs) pair is achievable. 

Note that if P* = u, i.e., if the channel satisfies dominant cyclic shift symmetry, then 
[/* = is optimal since any cyclic shift of u is u itself and thus U* is independent of X. 
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